Robust clusterwise linear regression through trimming
نویسندگان
چکیده
The presence of clusters in a data set is sometimes due to the existence of certain relations among the measured variables which vary depending on some hidden factors. In these cases, observations could be grouped in a natural way around linear and nonlinear structures and, thus, the problem of doing robust clustering around linear affine subspaces has recently been tackled through the minimization of a trimmed sum of orthogonal residuals. This ‘‘orthogonal approach’’ implies that there is no privileged variable playing the role of response variable or output. However, there are problems where clearly one variable is wanted to be explained in terms of the other ones and the use of vertical residuals from classical linear regression seems to bemore advisable. The so-called TCLUST methodology is extended to perform robust clusterwise linear regression and a feasible algorithm for the practical implementation is proposed. The algorithm includes a ‘‘second trimming’’ step aimed to diminishing the effect of leverage points. © 2009 Elsevier B.V. All rights reserved.
منابع مشابه
Fuzzy clusterwise linear regression analysis with symmetrical fuzzy output variable
The traditional regression analysis is usually applied to homogeneous observations. However, there are several real situations where the observations are not homogeneous. In these cases, by utilizing the traditional regression, we have a loss of performance in fitting terms. Then, for improving the goodness of fit, it is more suitable to apply the so-called clusterwise regression analysis. The ...
متن کاملRegularized fuzzy clusterwise ridge regression
Fuzzy clusterwise regression has been a useful method for investigating cluster-level heterogeneity of observations based on linear regression. This method integrates fuzzy clustering and ordinary least-squares regression, thereby enabling to estimate regression coefficients for each cluster and fuzzy cluster memberships of observations simultaneously. In practice, however, fuzzy clusterwise re...
متن کاملClusterwise PLS regression on a stochastic process
In this paper we propose to use the PLS approach for clusterwise linear regression in the particular case where the set of predictor variables forms a L2-continuous stochastic process {Xt}t∈[0,T ]. We have adapted the k-means algorithm to this case and we give necessar conditions for its convergence. The results of an application of the clusterwise PLS regression to stock-exchange data are comp...
متن کاملRobust nonparametric kernel regression estimator
In robust nonparametric kernel regression context,weprescribemethod to select trimming parameter and bandwidth. Through solving estimating equations, we control outlier effect through combining weighting and trimming. We show asymptotic consistency, establish bias, variance properties and derive asymptotics. © 2016 Elsevier B.V. All rights reserved.
متن کاملPCR and PLS for Clusterwise Regression on Functional Data
Clusterwise regression is applied to functional data, using PCR and PLS as regularization methods for the functional linear regression model. We compare these two approaches on simulated data as well as on stock-exchange data.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computational Statistics & Data Analysis
دوره 54 شماره
صفحات -
تاریخ انتشار 2010